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A random triadic process 


Daniel Korandi * Yuval Peled 1 Benny Sudakov 1 


Abstract 

Given a random 3-uniform hypergraph H = H{n,p) on n vertices where each triple indepen¬ 
dently appears with probability p, consider the following graph process. We start with the star Go 
on the same vertex set, containing all the edges incident to some vertex vq, and repeatedly add an 
edge xy if there is a vertex z such that xz and zy are already in the graph and xzy S H. We say 
that the process propagates if it reaches the complete graph before it terminates. In this paper 
we prove that the threshold probability for propagation is p = 2 ^- We conclude that p = 
is an upper bound for the threshold probability that a random 2-dimensional simplicial complex 
is simply connected. 


1 Introduction 

The principle of triadic closure is an important concept in social network theory (see e.g. m- 
Roughly speaking, it says that when new friendships are formed in a social network, it is more likely 
to occur between two people sharing a common friend, thus “closing” a triangle, than elsewhere. 
We will consider a simplistic model of the evolution of a social network, where friendships can only 
be formed through a common friend, and triadic closure eventually occurs at any triangle with 
probability p, independently of other triangles. We refer to this process as the triadic process. 

Formally, let H = H{n,p) be a random 3-uniform hypergraph on [n] where each triple inde¬ 
pendently appears with probability p. The triadic process is the following graph process. We start 
with the star Gq on the same vertex set [n], containing all the edges incident to some vertex vq, and 
repeatedly add any edge xy if there is a vertex z such that xz and zy are already in the graph and 
xzy E H. We say that the process propagates if all the edges are added to the graph eventually. It 
is easy to see that this event does not depend on the order the edges are added in. In this paper we 
prove that the threshold probability for propagation is 

Theorem 1.1. Suppose p = for some constant c > 0. Then, 

1. If c> then the triadic process propagates whp. 

2. If c < I, then the triadic process stops at 0{ny/n) edges whp. 
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As usual, we say that some property holds with high probability or whp if it holds with probability 
tending to 1 as n tends to infinity. 

Randomized graph processes have been intensively studied in the past decades. One notable 
example is the triangle-free process, originally motivated by the study of the Ramsey number R{3, n) 
(see e.g. [6]). In this process the edges are added one by one at random as long as they do not create 
a triangle in the graph. The triadic process is a slight variant of this, with a very similar nature. 
Indeed, our analysis makes good use of the tools developed by Bohman [2] when he applied the 
differential equation method to track the triangle-free process. Several other related processes were 
also analyzed using differential equations, e.g. [3]. For more information about this method we refer 
the interested reader to the excellent survey of Wormald m- 

Coja-Oghlan, Onsjo and Watanabe [3] investigated a similar kind of closure while analyzing con¬ 
nectivity properties of random hypergraphs. They say that a 3-hypergraph is propagation connected 
if its vertices can be ordered in some way ui,..., so that each Vi (i > 3) forms a hyperedge with 
two preceding vertices. They obtain the threshold probability for the propagation connectivity of 
H{n,p) up to a small multiplicative constant. Using this directed notion of connectivity, our problem 
asks when the random 3-hypergraph on the line graph of Kn is propagation connected from the star. 

Our main motivation for considering the triadic process comes from the theory of random 2- 
dimensional simplicial complexes. A simplicial 2-complex on the vertex set U is a set family Y C 
closed under taking subsets. The dimension of a simplex cj G T is defined to be |cr| — 1. We use the 
terms vertices, edges and faces for 0, 1 and 2-dimensional simplices, respectively. The 1-skeleton of 
a 2-complex is the subcomplex containing its vertices and edges. 

The Linial-Meshulam model of random simplicial complexes, introduced in [9], is a generalization 
of the Erdos-Renyi random graph model and has been studied extensively in recent years. The 
random 2-complex Y 2 {n,p) is defined to have the complete 1-skeleton, i.e., all vertices and edges, 
and each of the faces independently with probability p. The study of random complexes involves both 
topological invariants and combinatorial properties, including homology groups, homotopy groups, 
collapsibility, embeddability and spectral properties. 

One of the oldest questions of this kind, asked by Linial and Meshulam [9], is how the fundamental 
group 'Ki{Y 2 {n,p)) of the random 2-complex behaves. Babson, Hoffman and Kahle [I] showed that 
if p < for some arbitrary a > 1/2 then the fundamental group is nontrivial whp. On the other 
hand, they proved that TTi{Y 2 {n,p)) is trivial for p > y41ogn/n, which means that the threshold 
probability for being simply connected should be close to As a corollary of the first part of 

Theorem ll.il we improve the upper bound on the threshold probability. 

Corollary 1.2. Let p = for some constant c > ^. Then Y 2 {n,p) is simply connected whp. 

Proof. Suppose we have a 2-complex C such that one of its edges, e, is contained in a unique face 
/. Then we can collapse f onto the other two edges without changing the fundamental group of 
C. In fact, C — f — e is homotopy equivalent to C, the former complex being a deformation retract 
of the latter. We say that a 2-complex with complete 1-skeleton is a collapsible hypertree if we can 
apply a sequence of collapses to it and end up with a tree. Clearly, a collapsible hypertree has trivial 
fundamental group. 
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Now observe that Theorem 11.11 implies that ¥ 2 ( 11 ,p) contains a collapsible hypertree whp. Indeed, 
if the process propagates, then take C to be the subcomplex of the faces that correspond to the triples 
we used to add edges to the graph. Then by definition, the reverse of the triadic process on C is 
exactly a sequence of collapses resulting in a star. 

Basic results about the topology of complexes tell us that the addition of faces to a simply 
connected complex does not change the fundamental group, hence TTi(Y 2 {n,p)) is trivial whp. □ 

1.1 Proof outline 

Instead of exposing all the triples at once, we will be sampling them on the fly, trying to extend 
the edge set of the graph. Both the proofs of the upper bound and the lower bound consist of 
two phases. In the first phase we make one step at a time: we choose, uniformly at random, one 
(yet unsampled) triple spanning exactly two edges and expose it. With probability p the triple is 
selected, hence we can add the third edge to our edge set. The second phase proceeds in rounds: 
we simultaneously expose all the unsampled triples spanning two edges, and extend the edge set 
according to the outcome. 

The essence of the proof is to track the behavior of certain variables throughout the process. 
As we will see, this is not a very hard task to do in the second phase, using standard measure 
concentration inequalities. However, during the initial phase of the process, the codegrees (one of 
the variables we track) are not concentrated, which forces us to do a more careful analysis of the 
beginning of the process. For this we will use the differential equation method. 

We organize the rest of the paper as follows. In Section [2] we give an overview of how we 
apply the differential equation method. A detailed analysis of the actual implementation follows in 
Section [3l We move on to the second phase of the process in Section [H thereby completing the proof 
of Theorem 11.11 We finish the paper with some further remarks in Section [5l 

Notations: Throughout the paper, we will omit floor and ceiling signs whenever they are not 
necessary. The sign ± will be used to represent both a two-element set of values and a whole 
interval, but it should be clear from the context which one is the case. 

2 The differential equation method 

At any point in the process, we say that a vertex triple {u, v, tc} is open if it spans exactly two edges 
but has not yet been sampled. We will also use the notation uvw for an open triple with edges uv 
and vw. By an open triple at u, we mean a triple uvw, i.e., one that has its missing edge adjacent 
to the vertex u. 

In each step, our process picks an open triple uniformly at random and samples it. If the answer 
is positive then we close the triple by adding the missing edge to the graph. To analyze this process 
we apply the differential equation method, using some ideas from [2]. 

For simplicity, let us denote the graph we obtain after i samples by Gt. We consider the following 
random variables: Dy{i) is the degree of the vertex v in Gt. Fy{i) is the number of open triples at 
V, so it is the number of ways for v to gain a new incident edge in Gj+i. Xy^y{i) is the codegree of u 
and V, i.e., the number of common neighbors of u and v in Gj. 
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To provide some insight, we first heuristically describe the process. Let us assume for now that 
the Dy(i) are concentrated around some value D{i), and similarly the Fy{i) are approximately equal 
to some value F{i). We further assume that the variables are very close to their expectations. 

In step i+l we choose an open triple uniformly at random, so each triple is chosen with probability 
^ ^ then sample it. With probability p the sample is successful, hence we can close 

the triple. As the number of open triples at a vertex v is about F{i), the change in the degree of a 
vertex v we expect to see is 

n 

Now let us see how Fy{i) is affected by a step. We gain open triples at v either if we successfully 
sample one of them (adding the edge vw), in which case new open triples are formed with the 
neighbors of w, or if we successfully sample a triple at some neighbor of v. On the other hand, we 
lose the sampled triple regardless of the outcome. The probability of sampling an open triple at some 
specific vertex w is so assuming all the codegrees are negligible compared to D{i), the 

expected change is 

F(i + l)-F(z)«-(2pT»(i)-l). 
n 

To smooth out this discrete process, we introduce a continuous variable t and say that step i 
corresponds to time t = ti = Let us also rescale D and F by considering the smooth functions d 
and / in t, where we want d{t) to be approximately D{i)/y/n and f{t) to be approximately F{i)/n. 
Note that, since p = cj our assumptions so far suggest the following behavior: 


dPt) 


d{t + 1/n^) — d{t) 

X/rF 


n3/2(L»(i + l)-L>(i))«2c 


and 

/'(t) ~ /(t + l/nV/(t) ^ ^ ^ ^ 

1/n^ 

Let us emphasize that this little musing that we are presenting here is not a proof at all — a 
detailed analysis and the proof of concentration will follow in Section [SJ However, it at least indicates 
why it is plausible to believe that the actual values of Dy(i) and Fy{i) follow the trajectories of d 
and / given by the system of differential equations d' (t) = 2c and f'{t) = 4:cd{t) — 2. 


In the previous paragraphs we made the assumption that the codegrees are negligible compared 
to the degrees, but since they are not concentrated, proving this still needs some thought. To this 
end, we introduce two more random variables. Y^Pi) denotes the number of open 3-walks uww'v 
from u to V, i.e., 3-walks where we require that uww' be open (but allowing w = u), and Z^Pi) 
is the number of open 4-walks uww'w"v (again, allowing vertex repetitions), where both uww' and 
w'w"v are open. Note that is not symmetric in u and v. 

The point is that Yu,y and Zy^y are concentrated (as we will see in Section [3]), and — amazingly 
enough — their one-step behavior can be described with fairly simple formulas. So let us continue 
with our thought experiment and assume that all Yy^i) « Y{i), all Zy^i) ^ Z{i), and all variables 
are close to their expectations. 
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First of all, the increase in the codegrees comes from a successful sample in a 3-walk, so we expect 


“ 1 “ 1 ) 


2p{Yu,v{i) + y»,^(i)) 
nF{i) 


4cj/(t) 

n2/W 


This will be enough to prove a uniform O(logn) upper bound over all the codegrees, so we can keep 
ignoring the effect of X in the next few paragraphs. 

Let us look at the change in There are three different ways a new open 3-walk uww'v can 

appear after step f -|- 1, depending on which one of uw, ww' and w'v is the new edge. When uw is the 
new edge, there is a 4-walk utww'v in Gi where utw is open. We can count such configurations by 
first choosing w' as a neighbor of v and then choosing an open 3-walk utww'. Note that for any such 
choice, uww' will be an open triple in Gj+i, except if w' is the same as u, or a common neighbor of 
u and w. The latter cases are negligible, so there are about Y{i)D{i) possibilities in this case. 

Similarly, when ww' is the new edge, new 3-walks come from 4-walks uwtw'v, and we can count 
the number of options by first choosing w as a neighbor of u and then an open 3-walk from w to 
V. Again, the triple uww' will be open in Gj+i if w' is neither u, nor a common neighbor of u and 
V, so we find Y[i)D{i) possibilities of this type. Finally, w'v can only be the new edge if w'tv was 
successfully sampled in some open 4-walk uww'tv, so there are about Z{i) such options. 

On the other hand, we lose an open 3-walk if we sample its open triple, whether or not the sample 
is successful. As any particular triple is chosen with probability about this means that we 

expect to see 

Y{i + 1) - T(z) PS (p(2T(i)I)(f) + Z{i)) - Y(i)) . 

The change in Z(i) is a bit easier to analyze: Once again, we obtain a new 4-walk uww'w"v if 
one of its edges is added in step f -|- 1. We will assume it is the first edge, uw, but by symmetry 
our counting argument works for all other edges, as well. Then the 4-walk comes from a 5-walk 
utww'w"v in Gi- We can count the number of options by first taking an open triple vw"w' at v and 
then choosing an open 3-walk from u to w'. Again, the created 4-walk will automatically be open 
unless w' is u or a neighbor of u, so there are about Y candidates of this type and 4y {i)F{i) in 
total. And then of course, we lose an open 4-walk if we sample one of its two open triples, regardless 
of the outcome. This suggests 

Z{i + 1) - Z{i) PS ^ (4py(f)F(f) - 2Zii)) . 


Once again, we are looking for smooth functions y and z such that y{t) is approximately Y{i)ly/n 
and z{t) is about Z{i)jn. Then the same computation as before gives the differential equations 

y'W = - l)y{t) + cz{t)) 

and 

= J^{‘^cy{t)f{t) -z{t)). 


We have yet to talk about the initial conditions of the above system of differential equations. 
Our process starts with a star centered at some vertex uq, i.e., an n-vertex graph with n — 1 edges. 
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all of them touching vq. Then D„(0) = 1, F^(0) = n — 2, yn,i)(0) = 0 and Zu,viO) = n — 3 for 
any two vertices u and v other than vq. For convenience, we will drop the center of the star from 
consideration in the sense that we do not define the variables with vq among the indices. This is a 
technicality that allows us to prove concentration, and since our recurrence relations never use those 
variables, it causes no problem. 

Hence we obtain the initial conditions d{0) = 0, /(O) = 1, 7/(0) = 0 and 2;(0) = 1, and an easy 
calculation shows that the corresponding solution of our system of differential equations is 

d{t) = 2ct f{t) = 1 — 2t + 

y{t) = d{t)f{t) z{t) = p{t). 

In the next section we prove that the variables indeed closely follow the paths defined by these 
functions. 

3 Calculations 

In this section we show that our variables follow the prescribed trajectories up to some time T. Of 
course, we cannot hope to do so if f{t) vanishes somewhere on [0,T], as that would mean that the 
process is expected to die before time T. Now if c > 1/2 then / has no positive root, so this is not an 
issue: we can take T = ^/k)gn. However, if c < 1/2 then / does reach 0, first at time Tq = 

In this case T will be chosen to be a constant arbitrarily close to Tq. 

The allowed deviation of each variable will be defined by one of the error functions 

gi{t) = and g 2 {t) = (1 + d{t))e^^n~^^^, 

where 

iF = 100 • max ( 1 + ) . 

o<t<T\ f{t) f{t)J 

It is clearly enough to prove the first part of Theorem 11.11 for c < 1, so from now on we will 
assume this is the case. 

Let us dehne Gi to be the event that all of the bounds below in Proposition I3.11 ai-fei hold for 
every pair of vertices u and v and for all indices j = 0,... ,i. This section is devoted to the proof of 
the following result, which is the key to proving that the variables follow the desired trajectories. 

Proposition 3.1. Fix some vertices u and v. Then, conditioned on Gj-i, each of the following 
bounds fails with probability at most n~^^. 

(a) Dy{j) G [d{tj) ± gl{tj))^/n (d) Yu,v{j) G {y{tj) ± g 2 {tj))^/n 

(b) F„{j) G {f{tj)±gi{tj))n (e) Zu,v{j) G {z{tj) ± g 2 {tj))n 

(c) Xu,v{j) < 50log7^ 

As a corollary, we obtain our main result. 
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Theorem 3.2. Suppose c < 1, and T < yjlogn and K are defined as above. Then the bounds in 
Promsition \3.1\f a)-fe) hold with high probability for all vertices u andv and for every j = 0,...,r-n^. 

Proof. It is easy to check that Go always holds. If Bj is the event that, conditioned on Gj-i, at least 
one of these bounds fails for j, then the failure probability is exactly A trivial union 

bound over all pairs of vertices and all equations in Proposition 13. 1 1 shows that P[.Sj] < 5n“®, hence 
another union bound over the indices gives P[uJfi:lBj] < n~^ = o(l). □ 

To prove Proposition EH we follow the strategy in [ 2 ] and analyze each random variable sepa¬ 
rately. Our plan is to use some martingale concentration inequalities to bound the probability of 
large deviation. However, since we cannot track the exact values of the expectations, only estimate 
them by some intervals, we will use two separate sequences to bound each variable: A submartingale 
to bound from below, and a supermartingale to bound from above. 

Recall that a stochastic process Aq, Ai,... is called a submartingale if E[Aj_|_i|Ai,..., Aj] > Aj 
for all i, and a supermartingale if E[Aj_|_i|Ai,..., Aj] < Aj for all i. We say that a sequence 
Aq, Ai, ... of variables is {p, A)-bounded if Aj — 77 < Aj+i < Aj -|- A for all i. We call a sequence of 
pairs X^,X^, ... an (rj, A)-bounded martingale pair, if A^, A^,... is an (ry, A)-bounded submartin¬ 
gale and XQ,Xfi, ... is an (ry, A)-bounded supermartingale. The following concentration results of 
Bohman [2] are essential for proving that the variables follow the desired trajectories: 

Lemma 3.3 (Bohman). Suppose rj < A/10 and a < rjm. If 0 = X^,X^,... is an {rj, N)-bounded 
martingale pair then 

2 2 
P[A/j < —a] < and P[A“ > a] < 

The general idea for analyzing a random variable R{i), representing any of the above five variables, 
is the following. In step i, an open triple is sampled, and thus with probability p a new edge is added 
to our graph. We split the one-step change in R{i) into two non-negative variables: Aj is the gain 
and Cj is the loss in step i, so R{j) = R( 0 ) -|- — Ci- The gain comes from the contribution 

of the added edge after a successful sample. Loss can only occur when some open triple stops being 
open, either because it was sampled or because its missing edge was added through some other open 
triple (although the effect of the latter event is negligible compared to the former if the codegrees 
are small). 

Next we estimate the expectation of Aj (using the recurrence relations we hinted at in Section [2]), 
so that we can define Af and A~, shifted copies of Aj with non-negative and non-positive expec¬ 
tations, respectively. This way = Yll=i is an (ry, A)-bounded martingale pair, where ry is 
approximately the expectation and A is some trivial upper bound on Aj. We do the same with the 
Ci to define and the martingale pair = Yli=i 

Finally we establish a connection between the concentration of R{j) = R{0) -I ~ 

the concentration of our shifted variables and in Lemma (3.71 and then use the concentration 
of martingale pairs. Lemma 13.31 to bound the error probabilities in Corollary 13.81 

The rest of this section is devoted to the actual calculations. The reader might want to skip the 
details at a first reading. The first subsection establishes the tools we use to prove concentration, 
while the remaining five subsections prove one-by-one the five parts of Proposition 13.11 
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3.1 Tools 


The following claim will help us clean up the calculations of the expectations. Recall that K = 
100 • maxo<t<T (l + . 

Claim 3.4. Let 0 < t <T so that f{t) > 0 is bounded away from 0 (t might depend on n). If r{t) 
is one of the functions 1, d{f) or f{t) then 


(r(t) ± gi{t)){f{t) ± gi{t)) (l + 0(^)) 
f{t)±gi{t) 

{r{t)±g^my{t)±92{t)){l + OC-^)) 

f{t)±gi{t) 

{z{t)±g 2 {t)) + 

f{t)±gi{t) 




C r{t)d{t) ± ^92{t) 




Proof. Straightforward calculus shows that 


f{t)±gi{t) \f{t) Pit) \Pit)JJ 


and 


and also 


Using this, we will multiply out the formulas on the left-hand side of the inequalities. Note that giit) 
and g 2 {t) are both so in the expanded formulas, any term containing two factors of the 

type gait) or a factor of jg consumed by an Oin~‘^l'^) error term. Hence the left-hand 

side of the first inequality is contained in 

(r(t) ± giit))ifit) ± giit)) ± -h 

^ ± + C>(n"^/^) C r(t) ± ^9iit)- 


Similarly, the left-hand side of the second inequality is contained in 

. log^ n 


(r(t) ± giit))iyit) ± g 2 it)) 1 + 0( 


n 


1 ^ 9iit) 


fit) Pit) 


+ 0(n 


C rit)dit) ± 


( rjt) 

[fit) ' fit)il+dit)) ' Pit)il + dit)) 


+ 


hit) 


+ 


rit)yit) 


K 


52(t) + 0(n C rit)dit) ± —92it) 


20^ 


using yit) = /(t)(i(t) and g 2 it) = (1 -|- dit))giit). Finally, the left-hand side of the last inequality is 
contained in 

= ^iw)^ Pim+dm ) ^ ^ 

using zit) = Pit). □ 



The remaining lemmas connect the concentration of the original variables and those shifted by 
the expectations. We will use the following observations in the calculations. 

Claim 3.5. Let s{t) be a differentiable function on [0,T] such that supjg[o,T] l•s^(^)l = O{polylog n) 
and ti = \. Then 


1 




i-i 




slr^dr^Oin ^). 


i=0 


10 


Proof. It is a well-known fact in numerical analysis that for reals a < q <b 

rb 


s{T)dT — {b — a)s{q) 


< {b — a)^ sup |s^(t)|. 


Taking a = q = ti with b = tj+i and using tj+i — U = \, this gives 


rL+i i.(fr 

s{r)dr - ^ 




< 


ti 




sup |s'(t)|, 




i+lj 


and summing these up for i = 0 ,... ,j — 1 , we get 


1 

s{T)dT -^ > S{ti 


i=0 


< 


tj ■ sup^gjo^t^] \s'{t)\ /polylog 


= O 


n 




V 




< 0{n 


-T 


□ 


This claim will be applied when s is one of the functions d', f ,x' ,y', z', g'^ and g 2 , in which case 
s' is indeed bounded by 0 (polylog n) in the interval [ 0 , Vlog n]. 


Claim 3.6. For a G {1,2} we have 

9 a{r)dT < ^{ga{t) - 

Proof. Note that ga{t) = (p{t)e^'n~^^^, where ip{t) is either constant 1 or 1 -|- d{t). In both cases, 
(^( 0 ) = 1 and (p'{t) > 0 for t > 0 , so 

= {T'{t)e^^lK > ga{t). 

Hence 

^ ga{T)dT < = ^{gait) - 

as required. □ 

It is time to formally define the shifted variables. Recall that if R{i) represents one of our random 
variables, then we use the non-negative variables Ai and Ci for the one-step increase and decrease 
in R, respectively, so that R{i) — R{i — 1) = T* — C*. Our aim is to show that R{i) is approximately 
n'^rfti) for some real 7 , where the error (the allowed fluctuation of R) is bounded by n'^gaiU) for 
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some a G {1, 2}. Here our choice of 7 and a depends on the variable R represents: 7 will be 1 for F 
and Z, 1/2 for D and Y, and 0 for X, while a will be 1 for D and F, and 2 for X, Y and Z. 

To show the concentration of R, we approximate Ai and Ct by their expectations, which, as 
we shall prove, lie in the intervals ± 

respectively, for some appropriately chosen functions VA^t) and rc{t). Thus we can define the shifted 
variables Af and Cf having non-negative expectation, as well as A~ and C~ having non-positive 
expectation as follows: 

K 

Af = Ai- rY~‘^{rA{ti-i) =F —gaiU-i)) with Bf = '^Af and 

i=l 

K J 

cf = Ci- rF~‘^{rc{ti-i) =F —gaiU-i)) with Of = ^ Cf. 

i=l 

Lemma 3.7. Suppose the variable R satisfies R{j) = 'nZ+ Yl,i=i where r is a polynomial 

in t such that r'{t) = r^(t) — rc{t). Then 

R{j) < n'^{r{tj) + ga{tj)) - I2 F BJ - Df and 

R{j) > n^(r(t,) - g^{tfi) + /2 + B+ - DJ. 

Proof. Let us first consider the upper bound: 
j 

R[j) = n^r(O) + y^^Ai - Cj 

i=l 

= n^r(O) -h (a- + n'^-^{rA{ti-i) + ^ga{ti-i)) \ - Y (^f + n'^~^{rc{ti-i) - ^ga{ti-i)) 

i=l ^ ^ i=l^ 

j j 

= B~ - Df + n^r(O) - rc(tj-i)) + Y KgaiU-i) 

2=1 2=1 

Now we apply Claim 1531 with functions r^(t) — rc{t) (a polynomial) and Kga{t) (a product of a 
polynomial and an exponential function). As T < -y/logn, their derivatives are clearly bounded by 
0(polylog n) on [0,T]. 

R{j) < B~ - Df + n'^r(O) J - rc(r))dT +J B:ga{T)dT + 0{n~^) 

< n\r{tj) + 5 a(ti)) - n^-^/^/2 + Bj - Df 

using Claim iTBl and + 0{n~f > n'^“^/®/2 in the last step. 

The lower bound comes from an analogous argument by changing the appropriate signs. □ 

Using this, we can estimate the probability that R{j) deviates from its expectation: 

Corollary 3.8. Suppose the numbers 7,0 and the functions R,r,rA,rc satisfy the conditions of 
Lemma \377[ Suppose furthermore that Bf and Df are {gi, Ni)-bounded and {^ 2 , Af 2 )-bounded mar¬ 
tingale pairs, respectively, where ggNg < e and gp < X^/10 for fi = 1,2. Then the probability that 

n2r-l/3 

R{j) 0 n'^{r{tj) F ga{tj)) is at most 4e . 
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Proof. Lemma rS. 71 shows that R{j) > n'^{r{tj)+ga{tj)) implies n'^ I2 < —D^, hence this event 

is contained in the union of the events < B~ and —. A straightforward 

n2T-l/3 

application of Lemma 13.31 then gives a bound of e the probability of each event, thus 

n27-l/3 

R{j) > rB{r{tj) + ga{tj)) occurs with probability at most 2e . A similar argument using 

the other inequality of Lemma 13.71 gives the same bound on the probability of the event R{j) < 
rB{r{tj) — ga{tj)), finishing the proof. □ 


3.2 Degrees 

Recall that in this section, and also in the next four sections, we assume Qj-i holds, i.e., the values 
of D.I,, Fy, Xu,v,yu,v and Zy^v are all in the prescribed intervals during the hrst j — 1 steps. 


Proof of Proposition \3.1\f a). Let Aj be the indicator random variable of the event that an open triple 
at V was successfully sampled in step i. Then Dy{j) = A*. The probability that Aj+i = 1 is 


‘^pFyji) 2c{f{ti)±gi{ti)) ,K -i/6 

E.,FUi)^ n^/^ifiU)±g,{U)) - \ 2 


using Claim [331 
Set 


At=Ai- 


n 


3/2 



Kti-i 


n 



and = 

i=l 


then B^ is a l)-bounded martingale pair. So if we define Cj and rc{t) to be 0 for all i, then 

all the conditions of Corollary 13.81 are satisfied with the choice of r^(t) = 2c, r{t) = d{t), 7 = 1/2, 

a = 1 and e = 3n“^/^. Hence the probability that R{j) = Dy{j) is not in y/n{d{tj) ± gi{tj)) is less 
-„l/6 

than 4e < n~^^, using 150j < 150n^^/log n < n^logn. □ 


3.3 Open triples 


Proof of Provosition \FTV h). Here we break the one-step change in Fy{i) into two parts: Aj will be 
the gain in the open triples at v caused by the Lth sample and Ci will be the loss, so that we can 
write Fy{j) = n-l + A* - Ci. 

We may lose a particular open triple uwv in two different ways: either if we sample it, or if 
we successfully sample another open triple with the same missing edge vu. There are at most 
Xu,v < 50 log n candidates for this other triple and a successful sample has probability!? = 0(l/v^), 
so the linearity of expectation gives 


E[Ci+i] = 

using Claim 1331 
Set 


2F,m + 0(i^)) 2(/(t.) ± gi(f,))(l + 0(:^)) _ 1 




G 


n{f{ti) ± gi{ti)) 


n 


C i ( 2±—e^^^n-F^ 


= Ci- 1 ( 2 ^ 


n 




2=1 
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and 












then Dq^D^,... is a (^, 50 log n)-bounded martingale pair, because one sample can only “break” 
the open triples with the same missing edge, and there are at most codegree-many of them. 

On the other hand, as we have already mentioned in Section [21 there are two ways to obtain new 
open triples at v. The contribution of a new edge vu touching v in step i -|- 1 is Du{i) — Xu^v{i) 
because it creates an open triple at v with any edge of u except if the third edge is already there. 
Alternatively, a new edge incident to a neighbor u of u creates a new open triple unless it connects 
to another neighbor of v. There are at most Yluu'eD„{i) Xu,u'ii) open triples that could create an 
edge between two neighbors of u, so 


E[A,+i] 


2p 


E 

wuGFy{i) 



+ 


2p 




E 

u&Dy{i) 


E 0{X^y{t)) 

u,u'GD-u(i) 


Note how we abuse our notation to also think of the quantities Dy(i) and Fy{i) as the set they count. 
So u G Dy{i) should be understood as a neighbor of v and wu G Fy{i) refers to an open triple vwu. 
Using Claim [33] we get 


E[Aj_|_i] C 


4c{d{ti) ± gi{ti)){f{ti) ± - 0(^)) 


nifiti)±giiti)) 




n V 


This means that for 


Af = Ai-^ UcdiU-i) T and Bf = E 

^ '' 2 = 1 

... is a martingale pair. 

Next, we show that it is a -yTilogn)-bounded martingale pair. Indeed, adding an edge 

vw in step i + 1 can increase the number of open triples at v by at most Ai < Dy,{i) whereas an 
edge ww' not touching v can only increase it by one. The upper bound then comes from Dyy{i) = 
0{y/n log n) < y/nlogn and Ai > A^. On the other hand, is smallest when Ai = 0. Observing 
that 4cd{t) < 8cPy/logn we see that the change is bounded from below by (— logn/n). 

Therefore we can apply Corollary 13.81 with r^(t) = 4c(i(t), rc{t) = 2, r{t) = f{t), 7 = 1, a = 1, 
and £ = log^ nj^fn to show that the probability that = Fy[j) + 1 (or Fy{j)) is not in the interval 
n[f{tj) ± gi{tj)) is at most □ 


3.4 3-walks 


Proof of Propositiori. [SAY d). Once again, we break the one-step change in Yu^y{i) into two parts: Ai 
will be the gain in the open 3-walks from u to v caused by the i’th sample and Ci will be the loss, 
so we can write Yy^y{j) = EEi Ai-Ci. 

We lose a particular 3-walk uww'v either if we sample its open triple uww', or if we add the 
missing edge uw' by successfully sampling some other triple (as before, the latter event is unlikely 
since the codegrees are small). Then the linearity of expectation and Claim [331 gives 


nci+i] 


‘^Xu,vii){4 + 0{ ^)) 2{y{t) ± g 2 {t)){l + 0{ ^ i 
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So defining 

T f 52(t*-i)) and Cf, 

^ '' 2 = 1 

we get that 0^,0 ^,... is a (^^,501ogn)-bounded martingale pair. 

Now let us look at ^i+i, the number of ways a new open 3-walk uww'v can be created in step 
i + 1. We follow the analysis described in Section [2j If uw is the new edge, then we need to count 
the 4-walks utww'v in Gi where w' is not u or a neighbor of u, and utw is open. Let N be the set of 
such candidates for w', then |A^| = D^{i) — O(logn), and the expected contribution to ^j+i of this 
type is 


'Er^rii) n3/2(/(t) ± C/i(t)) “713/2^ f{t) 6 7 

Strictly speaking, we are using the linearity of expectation over the indicator variables for each fixed 
3-walk uww'v. The probability that this walk is created is the number of t’s such that utww' is an 
open 3-walk in Gi, divided by the number of open triples. 

We similarly get that the expected contribution where ww' is the new edge is 


Ew'&N 1 

Er^r{i) n3/2 


( ‘^cd{t)y{t) 

V fit) 


± f 92(i)) 


whereas new open 3-walks where w'v is the new edge come from open 4-walks uww'tv in Gi, so the 
expected contribution of this type is 


‘^pZuAi) 2c{z{t) ± g 2 it)) J_ f 2cz{t) K \ 
ErFri^) n3/2(/(t)±5i(t) “ n3/2 ^ /(t) 6^"^ >) ' 


Putting all of these together, we see that for 


= Ai - 


n3/2 


/ 2c(2d(ti-i)y(ti-i) + z(ti_i)) ,K '' 

- Jit-;) -±y»2(*,-.) 


= Ei=i^f i® ^ martingale pair. In fact it is C ‘^/2 , 50 log n)-bounded, since a new edge can 
contribute at most codegree-many new 3-walks. 

Now we can apply Corollary 13.81 with r^(f) = 2c(2(i(f)y(t) -|- z{t))/f{t), rc{t) = 2y{t)/f{t), 
r{t) = y{t) (recall the differential equation that y satisfies to see that r' = va — rc), 7 = 1 / 2 , 
a = 2 and e = log'^n/n3/2 to show that the probability that Rij) = Yu^y{j) is not in the interval 
y/niy{tj) ± g 2 itj)) is at most □ 


3.5 4-walks 

Proof of Proposition \3.1]f e }. This time we define Ai to be the number of new open 4-walks created in 
step i and Gi to be the number of open 4-walks we lose in step i, so that Zu,vij) = '^~‘2^+El=i ^i~Ci. 
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Once again, we lose an open 4-walk uww'w"v if one of its open triples uww' or w'w"v is sampled, 
or if one of their missing edges uw' or w'v is added through a successful sample of a different open 
triple. Hence we get, using Claim [3~i] 


E[C,+i] = 
So defining 




(,)(! +0(1^)) 4(z(f)±g2(t))(l + 0(^)) 1 U,it) K \ 

^ - n\f{t) ^ ^ J ' 


cr = Ci- 


1 / 4:Z{ti-l) K 


n 




and Df = ^Cf, 


2=1 


we get that D^, Df, ... is a ( ^ , 2500 log^ n)-bounded martingale pair, because an added edge of 
the form uw' or w'v can ruin at most Xu^w' ' X^/^y < (50 log n)^ open 4-paths. 

On the other hand, the analysis in Section [2] shows that a new open 4-walk uww'w"v can be 
created in four different ways, based on which one of the four edges was added in step f -|- 1. In the 
case when uw is the new edge, we need to count the 5-walks utww'w"v where utw and vw"w' are 
open, and w' is not u or a neighbor of u. Let M be the set of such edges w'w" for fixed u and v. 
Then 

|M| = Fy{i) - Yy^yii) - 0(Xy,y(i)) = Fy (i) - O ( ^ U lOg''^ U) , 
hence the expected contribution in this case is 




w'w"&M ^u,w 




,(f) ^ 2c(/(t) ± gi{t) + 0(^)) {y{t) ± g 2 {t)) ^ i ^ 2cf{t)y{t) ^ 


n{fit)±gi{t)) 


V fit) 


But the remaining three cases are essentially the same, we only need to switch u and v or the 
two indices of the variables Y. This means that 


n ( m ' I ’ 


so we can define 


-d Bf=±Ar. 


4=*= = 4 • — 

* * n V fiU-i) 


where Bq, Bf ,... is a 3d/nlog^ n)-bounded martingale pair. This is because a new edge of 

the form uw can add at most Dy{i)Xyy^yjii{i) = 0{^/n\o^t‘^ n) < yTilog^n new 4-walks and the 
same bound works for an edge touching u, whereas a new edge not touching u and v creates at most 
100 log n open 4-walks: at most codegree-many in both of the positions ww' and w'w". 

Now we apply Corollary 13.81 with r^(t) = 8cf{t)y{t)/f{t), rc(t) = 4z(t)//(t), r{t) = z{t) (the 
differential equation for z implies r' = r a — re), j = 1, a = 2 and £ = log® n/y/n to show that the 
probability that i?(j) = n + Zy^vij) is not in the interval niz{tj) ± 52 (ij)) is at most 4e < 

n- 10 . □ 
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3.6 Codegrees 

Proof of Proposition, \S. iV c). Let Ai = — ^u,v(i — 1) be the increase in the codegree of u and 

u in a step so that Xu^v{j) = 1 + ^ 11=1 to see that dj+i is the indicator random variable 

of the event that the open triple of an open 3-walk from u to u or from r to u is successfully sampled 
in step i -|- 1. The probability of this event is 

‘2^p{yu,v{i) Ac{y{t) ± g 2 {t)) ^ 1 /4cy(t) K \ 

E. i4(i) n2(/(t) ± g,{t)) - n2 V fit) 2 V ■ 

So if we set A~ = At — ^ T52(ti-i)^ then B~ = Ei=i i® ^ supermartingale 

and it is l)-bounded. Now we can apply Lemma [3171 with 7 = 0, a = 2, r^(t) = = 

4:cd{t) = 8c^t, rcit) = 0 and r{t) = -|- 1 to Rif) = Xu,v{j)- Then the first inequality gives 

Xu,vij) ^ 1 + ^^t^j + 52(tj) + B- . 

Therefore (keeping in mind that tj < ^/logn and c < 1) we see that if Xu,vij) > 50logn then 
BJ > 25 log n. But by Lemma 13.31 this has probability at most 

25^log^ n 

g 30 log ri g 10 log 71. _ ^ 10 

for any j < n^y^log n, finishing our claim. □ 

4 The second phase 

In this section we analyze the second phase of the process and prove our main result, the lower and 
upper bounds on the threshold probability. Unlike in the first phase, where we made one step at a 
time, here we expose triples in rounds. In a round we simultaneously sample all the currently open 
triples, and then add the edges accordingly. 

Let us adapt our notation to the second phase as follows. From now on f = 0,1,... will 

denote the degree of the vertex v after i rounds in the second phase. For example, T*«(0) is the 
degree of v at the end of the first phase, i.e., DviTn^) with the old notation. We similarly re-define 
the other variables F^j, Xu^v^yu,v and Zu^v, and let Gi denote the graph after the i’th round. 

We will make use of the following Chernoff-type inequalities (see, e.g., m)- 

Claim 4.1. Let X ~ Bin{n^p) he a binomial random variable. Then 

P 

1. P [X > np + a] < e and 

-jp. 

2. P [X < np — a] < e _ 
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4.1 The lower bound 

Suppose c < ^ is some fixed constant. Before we start the second phase, we need to decide how 
many steps the hrst phase should take. Recall that f{t) has a root at Tq = and that it 

is monotone decreasing in the interval [0, Tq]. It is easy to check that d(TQ) < 1, so fix a positive 
constant 5 < 1 — d(To) and choose e > 0 so that < 6. We define the stopping time T to be in 
the interval [0,To] so that f{T) = e/2. Hence if we apply Theorem 18.21 with this T, we get that after 
Tn^ steps 

• Dy{0) < {d{T) + gi{T))y/n < (1 - S)^/n and 

• Tu(0) < (e/2+ 5i(r))n < en 

for every vertex v. At this point, we move on to the second phase of the process. 

The plan is to show that the second phase ends in O(logn) rounds, while all the degrees stay 
below ^/n. This would imply that the final graph has at most riy/n edges, in particular, it is not 
complete. The following statement bounds the degrees of the vertices in the first O(logn) rounds. 
Showing that in the meantime the second phase gets stuck will be an easy corollary. 

Claim 4.2. Let m = 41og;^/2c’^- Then, with high probability, Dy(i) < y/n for every vertex v and 
0 < i < m. 


Proof. We will prove by induction that with high probability 

• Dy{i) < {l — 5 + {1 + 2c + ... + (2c)*“^)ce + i ■ y/n and 

• Fv{i) < ((2c)*e + 2n“^/®)n 


hold for every vertex v and 1 < i < m. Note that, by our choice of e, the bound on the degrees is 
less than — <5 + + i ■ ^/n < ^Jn. 

To proceed with the induction, we condition on the event that the bounds hold for i and then 
estimate the probability that they fail for z + 1 for some vertex v. 

First we show that the degree of each vertex increases by at most ((2c)*ce + ^/n in round 

z + 1. Indeed, the number of new edges that touch the vertex v is stochastically dominated by the 
binomial distribution Bin (^Fy{i), . Hence, by the hrst Chernoff-bound in Claim ITTl 


P 


Dy(i + 1) 


Dy{i) > Fy{i)-^ + (1 - 2c)n^/^ 
\/n 




so a union bound over all the vertices shows that the hrst bound fails in round z +1 with probability 
at most 

The second inequality follows from the hrst one by an easy counting argument. Since we sample 
all the current open triples every round, the ones counted in Fy{i + 1) are all new triples, i.e., they 
contain at least one new edge added in round z + 1. Now an open triple either has a new edge incident 
to V or not. If it does, we can choose it in at most ((2c)*ce + y/n ways, and then extend each 
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choice in at most ^/n ways to get a triple (as all degrees are below y/n). If not, then we first choose 
a neighbor of v and then a new incident edge. Consequently, the total number of open triples at v is 

+ 1) < 2 • ^(2c)®ce + y/n ■ y/n = ((2c)*''“^e + 2n~^^^)n. 

Taking a union bound over all the m rounds then completes the proof. □ 

Corollary 4.3. Let Q{i) he the total number of open triples after i rounds. Then Q{m) = 0 with 
high probability. 

Proof. If a triple is open after the i’th round, then it contains at least one new edge. Of course, the 
number of open triples containing some fixed new edge uv is at most D^{i) + Du{i) < ‘2'y/n, whp. On 
the other hand, the number of new edges cannot exceed the number of positive samples in the i’th 
round, distributed as Bin((5(i — 1), -^). Putting these together, this means that Q(0),..., Q{m) is 
a sequence of random variables where Q{i) is stochastically dominated by 2y/n ■ Bin(Q(i — 1), -^). 
In particular, 

E[g(z)] = E[E[g(z)|g(z - l)]] < E[2cQ(i - 1)] = 2cE[Q(i - 1)]. 

Using g(0) < n^, a simple application of Markov’s inequality gives 

P[g(m) > 0] < E[g(m)] < (2c)™Q(0) < = o(l). 


□ 

Proof of Theorem \l.ll part 2. Corollary 14.31 shows that whp the process runs out of open triples after 
at most m rounds in the second phase. According to Claim [521 this final stage all vertices have 

3/2 

degree at most y/n, i.e., the graph has at most edges whp. □ 

4.2 The upper bound 

Suppose ^ < c < 1 is fixed. Then we can run the first phase all the way, for n^y/logn steps. Indeed, 
as the function f{t) has a global minimum of / (^) = 1 — ^ > 0, we can apply Theorem 13.21 with 
stopping time T = -^log n. 

Our plan is to give rapidly increasing lower bounds on the degrees and codegrees as the graph 
evolves, thus showing that we reach the complete graph in O(loglogn) rounds. Let us analyze the 
first round separately. 

The initial parameters of the second phase are, as implied by Theorem 13.21 

• Xu,v{0) < 50logn, 

• Zu,v{^) = lOc^relog^ n + 0 (nlog^^^ n) > 2 c^nlog^ n. 
for any vertices u and v. 

Lemma 4.4. There is some constant 7 > 0, such that the codegree Xu,vi^) > 7 log^ n for every pair 
of vertices u, v with high probability. 
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Proof. Fix u and v. We expect most of their new common neighbors to be vertices w with open 
triples to both u and v. So if denotes the number of open triples pqr, then we want many 
vertices w such that both and X^^^ are relatively large. 

Claim 4.5. For every pair of vertices u, v, there are at least a-n vertices w such that > 

4 4 

61ogn, where a = ^fgo ^ ~ fo positive constants. 

Proof. Note that an open 4-walk is just a sequence of two open triples, hence 

2 c^nlog^n < Zu,v{0) = ^ Xu,w{0) ■ 

Here each summand is bounded by (50 log n)^, so if fewer than an vertices w satisfy c^log^n < 
W,;,«)(0), then the right hand sum above is less than log^ n-n+{50 log n)^-an = 2 c'^nlog^ n, 
a contradiction. At the same time, the bound on the codegrees implies that each w with log^ n < 
-^u,«)( 0 ) • satisfies our requirements. □ 

Now if some w shares at least ftlogn open triples with both u and v, then it becomes a new 

( / \ b log n 

^ ~ V ~ ^ j 

(^IvF^) later in this section we use that (1 — a)^ < 1 — a/3/2 for all a/3 < 1). These 

events are independent for different w’s, hence Xu,v is bounded from below by the Binomial random 
variable Bin ^an, ^ 4 °^ • Then by Lemma Fd.ll the probability that Xu^v{^) is smaller than 7 log^ n 

is for a sufficiently small 7 . A union bound over all pairs of vertices finishes the proof. □ 

To make our life easier, we consider a slightly different second phase from this point on. Instead 
of sampling open triples with success probability p, we will consider a sprinkling process, and sample 
all triples with success probability in each round (starting from round 2). This means that 

some triples will have a higher than p chance to exist, but as long as the number of rounds m 
is O(log log n), the effect is negligible: each triple is still sampled with probability at most ■ 

Formally we can say that we are proving the result for any constant P > c. 

To give a lower bound on the codegrees in Gj+i, we define the following sequence: 

Xj = 7 log n, ^ = l,...,m 



with Xm+i = 75 , where we choose m = O(log log n) to be smallest possible such that Xm > logn. 
Let us also set pj = 1 — 1- , 7 

\ \/n log n J 


Lemma 4.6. With high probability 


Xu,v{,i) Pi ^2 


for all 1 < i < m + 1 and all pairs of vertices u and v. 
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Proof. Lemma 14.41 shows that the lower bound on the codegrees holds for i = 1, so assume 2 < i. 
We condition on the event that the statement holds for i — 1 and bound the probability that it fails 
for i. 

We claim that under these conditions G{n,pi) is a subgraph of Gj. To see this, observe that if 
an edge uv is missing from Gi-i, then it has Xu^y{i — 1) > Xi-i independent chances of probability 
\/n logn of being added in the z’th round. Moreover, these events are independent for the different 
non-edges, as the triples are sampled independently, and each triple has at most one missing edge. 
This means that missing edges are added independently with probability at least pi while existing 
edges are kept in the graph, thus indeed G{n,pi) C G*. 

We intend to use the Chernoff bound to show that all the codegrees in G{n,pi), and thus also 
in Gj, exceed x*. For this, observe that the codegree of any fixed pair of vertices in G{n,pi) is a 
binomial random variable Ri ~ Bin(n — 2,pf). A straightforward calculation gives > 2xj. 

Indeed, for 2 < i < m we have 


(n 




4 

^/nTo^ 



(n - 2) • 
nlogn 


log n 


= 2xj. 


whereas for i 


m + l (using Xm > \^/n logn), 


(n 



1 - ^ \ 
Vnlogn J 


2 

> (n 


2 )(l-l/e)2 > 2 x^+ 1 , 


Thus, as Xj > (ilog^n. Claim HTl shows that P[i?i < x*] = e Now taking the union bound 

over all vertex pairs and over all i finishes the proof. □ 


Proof of Theorem M.R part 1. We claim that Gm +2 is the complete graph. Indeed, Lemma [4. 61 shows 
that whp all the codegrees in Gm-i-i are linear, so the probability that a fixed edge is missing from 
Gm +2 is at most (1 — 4 ,/A union bound over all pairs of vertices then 
completes the proof. □ 


5 Concluding remarks 

Probably the most natural question that one can ask is the following. What happens if the process 
starts with some other tree, and not the star? Intuitively it seems that we are in a worse situation as 
there are fewer open triples to start with. We would therefore expect that if p < then starting 
with any fixed tree, the triadic process fails to propagate whp. In fact, we believe that whp this 
holds for all trees simultaneously. 

Using the topology language, this is equivalent to saying that p = jg the threshold prob- 
ability for a random 2-complex to contain a collapsible hypertree (the upper bound comes from 
Corollary We must note that a complex can have trivial fundamental group without actu¬ 

ally containing a collapsible hypertree. A yet stronger question would be to ask for a lower bound 
matching the bound on the threshold in Corollary 11.21 for being simply connected. 

Going in a different direction, it would also be interesting to study similar processes that are 
perhaps more meaningful from the social networks point of view. For example, a triadic process 
where vertices are discouraged to reach high degrees could be a more realistic model. 
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Note added in proof. After this paper was written, we learned that Gundert and Wagner [7] 
independently improved the bound in [T], and showed that Y2{n,p) is whp simply connected for 
p = where c is a sufficiently large constant. 
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